On Non- Complete Sets and Restivo's Conjecture 



Vladimir V. Gusev, Elena V. Pribavkina 

Ural State University, Lenina st. 51, 620083, Ekaterinburg, Russia 
vl .gusevSgmail . com, elena.pribavkina(5usu.ru 



^— ■>) I Abstract. A finite set S of words over the alphabet S is called non- 

. complete if Fact{S*) 7^ S* . A word w € S* \ Fact(S*) is said to be 

uncompletable. We present a series of non-complete sets Sk whose mini- 
^. mal uncompletable words have length 5fc^ — 17fc+ 13, where fc > 4 is the 

maximal length of words in Sk- This is an infinite series of counterex- 
amples to Restivo's conjecture, which states that any non-complete set 
possesses an uncompletable word of length at most 2k . 
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1 Introduction 



^ \ Let S he a. finite alphabet. A finite set S of words over the alphabet E is called 

complete if Fact(S*) = S* , i.e. every word over the alphabet 17 is a factor of a 
word of S* . If 5' is not complete, S* \ Fact{S*) is not empty and a word in this 
set of minimal length is called a minimal uncompletable word (with respect to 
the non-complete set S). Its length will be denoted by uwl{S). 

■ The problem of finding minimal uncompletable words and their length was 
! introduced in 1981 by Restivo. In his paper [S] he conjectured that a non- 

■ complete set 5* always possesses an uncompletable word w of length at most 
I 2k'^, where k is the maximal length of words in S, and w is of the form w = 

■ UV1UV2 ■ ■ • uvk-iu, where u ^ S, \u\ = k and \vi\ < k for all i = 1, 2, . . . , fc — 1. 
An example giving a lower bound k'^ + k ~ 1 for the length of minimal uncom- 
pletable words was presented in However Restivo's conjecture appeared to 
be false by means of a counterexample found in [5]. Namely, let fc > 6 and let 
Rk = S'=\ {a'^-'^bb} U Sba'^-^E U Eba Ub'^U Jk, where Jk = [j'l=!iba'S U a'b). 

^ , In [2] the authors computed for 7 < fc < 12 that the length of a minimal un- 

completable word for Rk is equal to 3fc^ — 9fc + 1 but were unable to prove it in 
general. 

In this paper we present a new series of non-complete sets Sk whose minimal 
uncompletable words have length 5fc^ — 17k + 13 for k > 4. 

As far as the upper bound is concerned, only trivial exponential one is known. 
More precisely, the length of a minimal uncompletable word is at most 2ll'^ll^™+"'^, 
where m is the number of elements in S and \\S\\ is the sum of lengths of all 
elements in S. It comes from the connection between non-complete sets and 
synchronizing automata studied in [3]. However this bound is not likely to be 
precise. 

An interesting related question of deciding whether a given regular language 
L satisfies one of the properties S* = Fact{L), U* = Pref{L), S* = Suff{L) has 
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been recently considered by Rampersad et al. in where the computational 
complexity of the aforementioned problems in case L is represented by a de- 
terministic or non-deterministic finite automaton is studied. In particular case 
L = S* for S being a finite set of words the authors mention that the complexity 
of deciding whether or not S* = Fact{S*) is still an open problem. 

2 The set Sk 

To fix the notation, let us recall some basic definitions from combinatorics on 
words. By we denote the length of a word w. The length of the empty word 
e is equal to zero. By 17+ we denote the set of all non-empty words over the 
alphabet U; by 2J'^ - the set of all words of length k over S and by Z'-'^ - 
the set of all words of length at most k over E. A word u £ 17+ is a factor of 
w (prefix or sujfix respectively) if w can be decomposed as w — xuy [w = uy 
or w = xu respectively) for some x,y S*. A factor (prefix, suffix) w of it; is 
called proper if it ^ w. Given a word u = 0102 • • ■ a„ S 17+ by u[i ■■■,]] with 
1 i ^ 'T- we denote the factor aiOi+i ■ ■ - Oj Hi < j, and the empty word if 
i > j. Moreover, we put u[0] — e. 
Let U — {a, b}. Consider the set 

Sk = (i7'= \ {ba''-\b''-^a}) U {S^-^ \ {a^-\b^-^}) . 

In section [3] we show that this set is not complete for fc > 4 and possesses an 
uncompletable word of length 5fc^ — 17fc-|-13. In section|4]we show that this upper 
bound is precise. Our results considerably rely upon the notion of a forbidden 
position in a word. This notion was introduced in [3]. Let 5* be any non-complete 
set and let w be an uncompletable word for the set S. We say that < j < |w| — 1 
is a forbidden position in w with respect to S, if w[j -I- 1, ... , \w\] ^ Pref{S*), 
i.e. the suffix of the word w starting from position j is not a prefix of any word 
in 5**. If the set S is clear from context we will omit reference to S. Note that, 
if S* C E-^ and positions 0, 1, . . . , fc — 1 are forbidden in some word w, then w 
is uncompletable for S. So to prove that a set 5* C E-^ is not complete, it is 
enough to find a word with first fc forbidden positions. 

Lemma 1. Let j > k be a forbidden position in a word w with respect to Sk- 
Then the position j — k is forbidden in w with respect to Sk iff either j — 1 is 
forbidden or w[j — fc + 1, . . . , j — 1] G {a'"'^^, 6*^"^}. 

Proof. Let j — fc be forbidden in w. Then by definition w[j — k + 1, . . . ,\w\] ^ 
Pref{SJl). Suppose j — 1 is not forbidden in w, i.e. w[j, . . . , \w\] ^ x G Pref{S^). 
If the factor y = ^[j — fc -I- 1, . . . , j — 1] of length fc — 1 is in iS^, then w[j — k + 
1, . . . , |w|] — yx G Pref{S^), which is a contradiction. Hence y e E^~^ \ Sk = 

Conversely, arguing by contradiction suppose j — fc is not forbidden. Then 
since the length of the suffix w[j — k + 1, . . . ,\w\] is at least fc, it can be factorized 
as xy where x € Sk and y G Pref{S^). The case |x| = fc contradicts the condition 
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that j is forbidden, since we get w[j+l, . . . , \w\] = t/ G Pref{S^). Hence \x\ ~ k—1 
and since a; G 5^ we have that x is different both from a''^-^ and b''^^. But then 
position J — 1 is not forbidden, which is a contradiction. 

In the rest of the paper we strictly fix the following notation: u = ba''~^ and 

V = b''~^a. We will consider forbidden positions only in occurrences of u and 

V in w. In each such occurrence for convenience we will enumerate forbidden 
positions locally from to /c — 1. 

Example 1. Consider the following word: 

w = [bWa, ab'^b^aa a !bba . 

{0,1,2} {0,2} {0} 

Using definition and lemma [1] it is easy to calculate the set of its forbidden 
positions with respect to ^3: {0, 1, 2, 4, 5, 7, 10}. There are two occurrences of u 
and two occurrences of w in w (the first occurrence of v overlaps with the second 
occurrence of u). Locally enumerated sets of forbidden positions are: {0, 1, 2} in 
the first occurrence of u, {1} in the second occurrence of u, {0,2} in the first 
occurrence of v, and {0} in the second occurrence of v. Note that, since first 
three positions are forbidden in w, this word is uncompletable for S3. 

Position may be forbidden in an occurrence of u or w. The following state- 
ment gives necessary and sufficient conditions for this to happen. It is an easy 
consequence of lemma [H 

Lemma 2. Position is forbidden in an occurrence of u in a word w iff position 
k — 1 is forbidden in the same occurrence of u. Position is forbidden in any 
occurrence of v in w. 

Two occurrences p,q d {u, v} in a word w are said to be consecutive if they 
either overlap or are the only occurrences from {u,v} in the factor pxq of w. 

Lemma 3. Let p,q £ {u, w} be two consecutive occurrences without overlap in 
a word w, and let pxq be a factor of w with \x\ > 0. Let Fp and Fq be the sets 
of forbidden positions in p and q respectively. Then 

Fp C {j + \x\ mod k\jeFq}U {0}. 

Proof. Consider a forbidden position j in p such that i ^ and consider the 
factor y of length multiple of fc in w from position i in p to some position j in q 
(0 £ J < fc)- This factor is in S'^ , since p and q are consecutive occurrences of 
words from S'^ \ Sk. Thus, if position j ^ Fq, then neither position i is forbidden. 
On the one hand, we have = mod fc, on the other hand \y\ = fc — i -I- |a;| -I- j, 
hence i = j + \x\ mod k. 
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Note that, two words from S'' \ Sk overlap only in case of v and u. More 
precisely, two last letters of v overlap with first two letter of u leading to the 
word 6'^~^a''~^. The following statement can be easily proved using the same 
argument as in the previous lemma. 

Lemma 4. Let v and u be two consecutive overlapping occurrences in a word 
w, and let Fy and be the corresponding sets of forbidden positions. Then 

F, C {j - 2 mod k\j eFu}U {0}. 

Previous lemmas allow us to make the following observation. Let p,q (z {u, v} 
be two consecutive occurrences in w. Then forbidden positions in p except 
are inherited from forbidden positions in q, and position may appear in Fp 
according to lemma [51 In our proofs we will trace backwards forbidden positions 
only in occurrences of words from IJ''\Sk starting from the last one. Besides, the 
number of forbidden positions in consecutive occurrences increases by at most 1. 

3 Upper bound for uwl{Sk) 

In this section we prove that the set Sk is non-complete by presenting an un- 
completable word w of length 5fc^ — 17k + 13 for fc > 4. 

Theorem 1. For fc > 4 the set Sk is not complete and there exists an uncom- 
pletable word of length 5fc^ — 17fc + 13. 

Proof. For clarity by r we denote overlapping occurrences of v and u, i.e. r = 

Consider the word lo — u . Y\ (ra* . 5'°~^~V where . can be re- 

1=1 

placed by any letter of the alphabet S. Let us enumerate occurrences of v 
counting backwards from the last one (the first occurrence of v in this order 
will have number 0), and let F^ C {0, . . . , fc — 1} be the set of forbidden posi- 
tions in the ith occurrence of v. Occurrences of u are counted in the same way 
(but starting from 1 instead of 0) and Fl^ are defined analogously. Note that, 
F° = {0}. Then using previous lemmas it is easy to see that F^ — {1} and 
F^ = {0, fc — 1}. By lemma |3] we have F,^ C {0, fc — 2, fc — 1} and by lemmas [T] 
andllwe get i^^ ^ {0, fc - 2, fc - 1}. Analogously we obtain = {0, fc - 3, fc - 2}. 
Now assume F^' = {0, fc — i — 2, . . . , fc — 2}, 1 < i < fc — 4, and let us show that 
F^^*+^^ = {0, fc — j — 3, . . . , fc — 2}. Applying step by step lemmas [T]|3] we obtain 
the following sets of forbidden positions: 

F2'+i = {0, l,fc-i,...,fc-l}, 

= {0,fc-i-l,...,fc-l}, 
F2^+2 = {o,fc_i-2,...,fc-l}, 
F2^+2 = {0,fc-i-3,...,fc-2}. 

Thus, for i = fc — 3 we have Fy'"^ = {0, 1, . . . , fc — 2}. Hence the set of forbidden 
positions in the last occurrence of u is F^''^ "^'^^ = {0, 1, . . . , fc — 1}, which means 
that the word lo is uncompletable. Its length equals 5fc^ — 17fc -f- 13. 
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4 Lower bound for uwl(Sk) 

First wc prove some nice properties of a minimal uncompletable word in 5*^ . 

Theorem 2. Consider a minimal uncompletable word w. Then u is a prefix of 
w and V is its suffix. 

Proof. The word w has either u or as a factor, otherwise w G Pre/((5fc fliJ'^)*). 
Let w = w'x, where \x\ = k, and suppose x ^ v. Let x = x' z, where z & S. Since 
w is minimal, we conclude that w'x' G Fact{S'^), which means rw' x' = qy, for 
some q & S^. and \y\ < k — 1. If |y| < A; — 1, then yz G Pref{Sk), because all the 
words of length at most fc — 1 are prefixes of some words in Sfe. If |y| = fc — 1, then 
yz = X. li X u, then yz G Sk, and qyz = rw ^ S^. li x = u, then y = ba^~^, 
z = a and we have for instance rwa'^~^ G 5^. In any case we get a contradiction 
with the fact that w is uncompletable. Thus, w has w as a suffix. 

Now we are going to investigate one particular symmetr^property of uncom- 
pletable words for Sk- It is trivial that the mirror image 5 of a non-complete 
set S is again non-complete. Moreover, mirror images of uncompletable words 
w for S are uncompletable for S . The same property holds true for renaming 
morpliism: Lp{a) = b and ip{b) = a. Applying these statements to our set we get 
Tk = % = <p{Sk), where 

Tfe = {S'' \ {a''-\ ab''-^}) U {s''-^ \ {a'^'S 6^"^}) . 

So, if w is an uncompletable word for Sk, then <p{tu) is also uncompletable for 

Sk- As we have already shown, every minimal uncompletable word has w as a 
suffix. From the symmetry property it follows that every such word has u as a 
prefix. 

The suffix V oi a. minimal uncompletable word w has only one forbidden 
position, namely 0, and in the prefix u of w all the positions from to fc — 1 are 
forbidden. Thus, we have to analyze how forbidden positions change from one 
occurrence of a word from E'^ \ Sk to the next one. 

Consider an arbitrary occurrence of a word from S'' \ Sk in w. Let F be the 
set of its forbidden positions. We will make use of the following representation 
of F: 

F = [fl,l,fl,2, - - - , fl.mi, f2,l, - - - , f2,m2- ■ ■ ■ fn,l, - - - , .fn,m„], 

where fij+i = fij + 1 mod k, > and n > 1. Simply speaking, 

we partition the set F into blocks of consecutive (with respect to cyclic order) 
forbidden positions. 

Example 2- Consider the word 6a^", and let F = {0, 1. 2, 5. 6, 8. 10} be the set of 
its forbidden positions with respect to S'n. Then according to our representation 
F= [5, 6; 8; 10, 0,1, 2]. 

Theorem 3. Let p and q be two consecutive occurrences of words from {u, v} 

in a minimal uncompletable word W- Let Fp and Fq be the sets of forbidden 
positions in p and q respectively- If \Fp\ > \Fg\, then one of the following holds 
true: 
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(i) p = q ^ u, Fp = {0, k—j, . . . , fc— 1} and Fq — {1, . . . , j}, where 1 < j < k—2; 

(ii) p — V, q — u, these occurrences overlap, Fp — {0, fc — 1} and Fq = {1}; 
(Hi) p = u, q ^ V, Fp = {0,j - i - I,.. .,k ~ 1}, Fq = {0, . . . ,i, j, . . . , k - 1}, 

where j ^ i + 1 mod k and there are k — 1 —i mod fc letters between these 
occurrences; 

Proof. Let Fq = [/i^i, . . . , . . . ; /„,i, . . . , /„,m„]- First assume that p and q 

do not overlap, so let pxq be the corresponding factor of w. 

Case 1. Let p ~ q ~ u. Consider the case n > 2. Then there exists 1 < i < n 
such that fi^i > 2. If fi,i > 2, then by lemma [T] position fi^i + \x\ ^ Fp. If 
fi^i = 2, then by the same lemma fi^i + |x| £ Fp implies x = x'h^~'^. But then 
is a factor of pxq which contradicts the fact that p and q are two consecutive 
occurrences from {u,v}. Thus, fi^i + |a;| is not forbidden in p. From lemma [3] 
it follows that Fp C {j + \x\ mod fc | j £ Fq} U {0}, whence \Fp\ < \Fq\, a 
contradiction. Consequently n — 1 and Fq = [/i.ii • ■ • : /i,m]- If fi.i > 2, then 
following the same argument as above, we conclude that \Fp\ < \Fq\, hence 
G {0, 1}. Note that, if /i.i = 0, by lemma [5] we have fc — 1 G Fq. It means 
that Fq ~ {0, 1, . . . , fc — 1} which contradicts minimality of w. Thus /i.i = 1. 
If fi,m — k — I, then by lemma [2] we obtain that £ Fq and in this case /i^i 
cannot be equal to 1. So /i,m < fc — 2. Now it remains to prove that Fp has form 
as stated in (z). Since \Fp\ > \Fq\, we have S Fp. Then by lemma[2]we obtain 
that fc — 1 is also in Fp. So there exists a position i £ Fq satisfying i + |a;| = fc — 1 
mod fc. If i < fi.m, then i + 1 + |a;| =0 mod fc, hence \Fp\ < \Fq\. Thus i = fi^m, 
\x\ = fc — /i.m — 1 mod fc and by lemma|3]we deduce Fp = {0, fc — /i,m, • ■ • , fc — !}■ 

Case 2. Let p = v, q = u. We are to show that either /i.i + |a;| is not 
forbidden in p or /14 + |.t| = mod fc, so in both cases \Fp\ < \Fq\. It holds for 

> 2 by the argument as in the previous case. Note that, /i^i ^ 0, otherwise 
by lemma [2] position k — 1 G Fq which contradicts our representation of a set of 
forbidden positions. Therefore, assume /i^i — 1. Suppose 1 + |x| = i mod fc. If 
< i < fc — 1, then by lemma [1] position i is not forbidden in p. If i = 0, then 
lemma [3] implies \Fp\ < \Fq\. If i = fc — 1, then by lemma [1] we have i G Fp only 
if a*^"^ is a prefix of x, but then p and q are not consecutive occurrences from 
{u,v}. 

Case 3. Let p — u, q = v. Suppose n > 2, and consider arbitrary positions 
fi,i < fj,i G Fq- We show that either fi^i + \x\ mod fc ^ Fp or fj^i + \x\ 
mod k ^ Fp. Arguing by contradiction, suppose both positions are forbidden 
in p. Then by lemma [T] the word x must have suffix b^~fi.^ (if |x| < fc — /^^i, 
then /j.i + |a;| mod fc ^ Fp by lemma[T]). Analogously b''"^^-^ must be the suffix 
of X, and if |x| < fc — fi^i, then fi^i + |a;| mod fc ^ Fp by lemma [T] But then by 
the same lemma fc + (fc — last letters of x are 6's. Continuing this argument 
we get that ik — fj^i last letters of x are &'s for any positive integer £. It means 
that there exists no finite word x such that both positions fi^i + \x\ mod fc and 

+ mod fc are forbidden in p. Hence n — 1, and Fq = [/i.i, • ■ • , fi,m]- Since 
|Fp| > \Fq\, by lemmas 12] and [5] there is no j such that fij + |a;| = mod fc. 
Besides fi^i + \x\ = fc — 1 mod fc for some i. If i < m, then /i.i+i + |a;| = 
mod fc, which is impossible. Thus, i = m. Moreover, |a;| = fc — 1 — fi^m mod fc. 



On Non-Complete Sets and Restivo's Conjecture 



7 



Since is always forbidden in v, we can represent Fq as {0, . . .i, j, . . . ,k — 1} for 
some 0<i<j<k — 1. Then by lemnia|3] we have Fp — {0, j — i — 1, . . . , k — 1}. 

Case 4- Let p ^ v, q ^ v. li /i.i + |a;| ^ Fp then \Fp\ < \Fq\. Otherwise by 
lemma [T] either + = mod k or fi i + = fc — 1 mod k. In the first 
case by lemma[3]we get \Fp\ < \Fq\. In the latter case by lemma[T]the word a^~'^ 
have to be the prefix of x, which contradicts the fact that p and q are consecutive 
occurrences. 

Now assume that p = v, q = u, and they overlap. If there exists i such 
that fi^i > 2 then by lemmas [1] and S] position /i.i — 2 ^ Fp, therefore \Fp\ < 
\Fq\. Note that, /i.i 0, otherwise by lemma [2] position k — 1 ^ Fq which 
contradicts our representation of a set of forbidden positions. Hence, for all i we 
have fi^i G {1,2}. It immediately implies that n = 1. li fij =2 for some j then 
by lemma m we get that fij — 2 is equal to and \Fp\ < \Fq\. Thus Fq = {1} 
and Fp = {0,fc- 1}. 

Lemma 5. Let p and q be two consecutive occurrences of u in a word w, Fp 
and Fq be the corresponding sets of forbidden positions. If Fq — {0, i, . . . , fc — 1} 
and \Fp\ = \Fq\, then Fp = Fq. 

Proof. If i = 1, then the statement of lemma obviously holds true. So we may 
assume i > 1. Let pxq be the factor of w. Then by lemma[3]we have Fp C {0, |x| 
mod k,i + \x\ mod fc, i + 1 + |x| mod fc, . . . , fc — 1 + mod fc}. By lemma [T] 
position i +1x1 mod k ^ Fp. Thus in order to have |Fp| = | | , it is necessary that 

e Fp and j + \x\ ^ mod k for alli + l < j < k. Lemma [2] implies fc — 1 G Fp. 
It means that j + |x| = fc — 1 mod k for some i + 1 < j < k.lt is straightforward 
that j — k, = fc — 1 mod k and Fp ~ {0, i, i + 1, . . . , fc — 1} = Fq. 

Lemma 6. Let p and q be consecutive occurrences of v and u respectively in a 
minimal uncompletable word w. Let Fp and Fq be the corresponding sets of for- 
bidden positions. If Fq = {0, i, . . . , fc — 1} and \Fp\ — \Fq\, then these occurrences 
overlap and Fp = {0, i — 1, . . . , fc — 2}. 

Proof. If j = 1, then trivially w is not a minimal uncompletable word, so i > 1. 
Suppose p and q do not overlap, so there exists x such that pxq is a factor of w. 
By lemma|3]we have Fp C {0, |a;| modfc,i+|a;| mod fc, i+l + |a;| modfc, ...,fc — 

1 + |a;| mod fc} and from lemma[l]it follows that either i+\x\ ^ Fp or i + \ x\ = 
mod fc. Moreover, if j + \x\ =0 mod fc for some i + I < j < k, then we 
immediately get |Fp| < \Fq\. So we may assume that j + \x\ ^0 mod fc for 
all i + 1 < j < fc. First let \x\ > k — {i + 1). Then by lemma [1] if position 
i + 1 + |a;| mod fc is forbidden in p, then either i + 1 + |a;| = mod fc or 
i + l + |a;| = fc — 1 mod fc. The first case contradicts our assumption. In the 
latter case x — a'^^^x', but this contradicts the fact that p and q are consecutive 
occurrences. Thus, both cases are impossible. So < < fc — (i + 1), but then 
we have i + l<j = fc— |a;| <fc. It means that j mod fc is a forbidden position in 
q and j + |x| =0 mod fc, which again contradicts our assumption that j + ^0 
mod fc for alH + 1 < j < fc. Therefore occurrences p and q overlap. By lemmas [1] 
andHwe get Fp = {0, i - 1, fc - 2}. 
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Lemma 7. Let p and q be two consecutive occurrences of v in a minimal un- 
completable word w, and let Fp and Fq be the corresponding sets of forbidden 
positions. If Fq — {0, i, . . . , fc — 2} then \Fp\ < \Fq\. 

Proof. Consider the factor pxq of w. By lenima|3]we get Fp C {0, |a;| mod k,i + 
\x\ mod fc, I + 1 + |a;| mod fc, . . . , fc — 2 + \x\ mod k}. We are going to show 
that |a;| mod k is not forbidden in p. From lemma [T] it trivially follows that the 
position mod fc G Fp if either |x| = mod fc or |x| = fc — 1 mod k. The 
first case contradicts minimality of w, for we would have Fp = Fq. In the latter 
case by the same lemma we conclude x — a^^'^x' , which contradicts the fact 
that p and q are consecutive occurrences. Similar arguments can be applied to 
i + |a:| mod k. Namely, if z + jxl = /c — 1 mod fc, then again x = a^^'^x' , a 
contradiction. So either i + |a;| = mod A; or i + |a:| mod A: ^ Fp. In both cases 
we have |Fp| < \Fq\. 

Lemma 8. Let p and q be consecutive occurrences of u and v respectively in a 
word w. Let Fp and Fq be the corresponding sets of forbidden positions, and let 
\Fp\ = \Fq\. If Fq = {0,1, . . . , fc - 2} with i > 1 and Fp = . . . , then 
either Fp = {0, 1, i + 2, i + 3, . . . , fc - 1} or Fp = {0, i + 1, i + 2, . . . , fc - 1}. 

Proof. Consider the factor pxq of w. By lemma|3]we get Fp C {0, mod k,i + 
\x\ mod k,i + l + \x\ mod fc, . . . , fc — 2+ mod fc}. Arguing as in case 3 of the 
theorem [3] we easily obtain that either \x\ mod fc ^ Fp or i + \x\ mod k ^ Fp. 
Suppose that position |a;| mod fc is not forbidden in Fp. Since |Fp| = \Fq\ it is 
necessary that S Fp, therefore fc — 1 is also in Fp by lemma[2l Every i < j < k—3 
satisfies j + |x| ^ fc — 1 mod fc, since otherwise we would have j + 1 + |x| = 
mod fc, which would imply |Fp| < \Fq\. Therefore, fc — 2 + |2:| = fc — 1 mod fc, 
whence \x\ = 1 mod fc and Fp = {0, i + 1, . . . , fc — 1}. Assume now that position 
i + |x| mod k ^ Fp. Following the same argument as in the previous case, we 
get fc — 1 G Fp and either |a;| = fc — 1 mod fc or fc — 2 + = fc — 1 mod fc. 
In the first case Fp = {0, i, . . . , fc — 3, fc — 1}, and it has more than one block of 
consecutive forbidden positions, which contradicts the statement of lemma. In 
the latter case \x\ = 1 mod fc and Fp = {0, 1, i + 2, . . . , fc — 1}. 

Theorem 4. The length of a minimal uncomletable word for Sk is at least 5k^ — 
17fc + 13 for fc > 4. 

Proof. Let w be an arbitrary minimal uncomletable word for 5*^. By theo- 
rem [2] the word u is a prefix of w and v is its suffix. Note that, we have 
Fu = {0, 1, . . . , fc — 1} and F„ ~ {0} in the aforementioned occurrences of u 
and V. 

If s and t are two consecutive occurrences from {u,v} such that \Fs\ > 
I Ft I, then we say that s is an increasing occurrence. Recall that by lemma [3] it 
means that |Fs| = |Ft| + 1. Since there is only one forbidden position in the 
last occurrence of w in w and fc forbidden positions in the first occurrence of 
u, there must be at least fc — 1 increasing occurrences in w. Now we are going 
to estimate the length of a factor between two consecutive such occurrences. 
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Consider a factor pxq of w such that p and q are the only increasing occurrences 
inside this factor. Note that, for an occurrence r G {u, u} in pxq, different from 
p and q (if any), \Fr\ = \Fq\. Otherwise p and q are not consecutive increasing 
occurrences. 

Let 3 < \Fq\ < \Fp\ < k. Then by theorem [3] we have p = q = u, Fq = 
{0, i, . . . , k — 1} and Fp = {0, i — 1, . . . ,k — 1}. Moreover, p and q are not the 
only occurrences from {u,v} in pxq. Assume first i > 2, i.e. \Fp\ < k. 

Suppose that there is only one occurrence r S {u,v} in pxq different from p 
and q. Then from lemma [5] it follows that r — v and since \Fr\ = \Fq\, applying 
lemma m we obtain Fr = {0,i — 1, . . . , k ~ 2}. But then since i > 2 the set Fr 
does not have the form required by theorem [3] for the condition \Fp\ > | to 
hold true. 

Assume now that ri, r2 G {u, v} are the two occurrences in pxq different from 
p and q. By the same argument as above r2 — v and Fr2 = {0, i — 1, . . . , fc — 2}. 
If ri — u, then on the one hand by theorem [3] it should be — {li 2, . . . , fc — 
i + 1}. On the other hand by lemma |8] position have to be forbidden in Fr^, 
a contradiction. If r2 — v, then by lemma [7] we have ji^rj < 1^1-2 1 : which is 
impossible. So there are at least three occurrences of words from {u, v} in pxq 
except p and q. 

Suppose that ri,r2,r3 G {u,v}. As we have already seen before — v and 
Fr^ = {0, i — 1, . . . , fc — 2}. It immediately follows from lemma [7] that r2 — u. 
Then is either {0, 1, i + i + 2, . . . , fc — 1} or {0, i, i + l, . . . , fc — 1} by lemmaEl 
The latter case contradicts minimality of w, since we would have |-F'r2l — Wq\- 
Assume ri = w, then by theorem [3] we have Fr^ = {1, 2, . . . , fc — i + 1}. Let 
fiyr2 be a factor of pxq. Note that, ^ -FVi and by lemma [T] position i + 1 + |?/| 
mod k ^ i^n- Thus ji^^i | < l-FV^ |, a contradiction. Therefore ri = u. This case is 
possible and exactly this situation takes place in the word presented in theorem[TJ 

Now we are going to estimate the length of x in this case. By lemma |6] the 
factor rs overlaps with q, so we can factorize x in the following way: either 
X = yiTiy2f'2yzb'^~^ or, if ri and r2 overlap, x — zil/^'^ a^~^ Z2b^^^ . By the proof 
of lemma [SI since Fr^ = {0, 1, i + 1, i + 2, . . . , fc — 1}, we have Ij/sI = |z2| = 1 
mod k. Let us first estimate the length of zi. Since ri and r2 overlap, lemma |4] 
implies Fr^ = {0, i, i + 1 . . . , fc — 1}. Then by theorem [3] there are fc — 1 mod fc 
letters between p and ri, thus \zi\ = fc — 1 mod fc. So in this case we have 
|x| > 4fc — 4. Now let us assume that ri and r2 are not overlapping. By theorem[3] 
the factor ri must have only one block of consecutive forbidden positions. By 
lemma [3] we have J^r-j C {0, i + 1 + 12/2! mod fc, «-|-2-|- |j/2| mod fc, . . . , fc — 1 + |y2| 
mod fc, \y2\ mod fc, I2/2I + 1 mod fc}. Note that, by lemma [1] either i + 1 + \y2\ 
mod fc ^ iVi or i + 1 + 12/2! = mod fc. So for the set F^.^ to have only one block 
of consecutive forbidden positions and the same cardinality as F^^ we must have 
either 1 + [J/2I = fc — 1 mod fcori + 2+ |y2| = l mod fc. In the first case we 
have \y2\ = fc — 2 mod fc and Fr^ = {0, i, i + 1, . . . , fc — 1}, but this is the same 
as in the case of overlapping occurrences ri and r2, so this is impossible in a 
minimal uncompletable word. In the second case we have \y2\ = k — i — 1 mod fc 
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and Fr^ = {0, 1, 2, . . . , fc — i}. Then by theorem [3] we conclude that |?;i| = i — 1 
mod k. Therefore in this case we have \x\ > 4fc — 3. 

It is not hard to see that, if there are more than 3 occurrences from {m, w} 
different from p and q in pxq^ then even if some of them overlap, the total length 
of the word x is at least Ak — 4. So we conclude that > Ak — A. Note that, 
I a; I = Ak — A for the word from theorem [TJ 

Now let i = 2, i.e. Fp = {0, 1, . . . , fc - 1} and Fq = {0, 2, 3, . . . , fc - 1}. 
Arguing as above, by lemmas [5] and |6] there is an occurrence r = v just before 
5, overlapping with q and Fr = {0, 1, . . . , fc — 2}. If this is the only occurrence 
and pxq = pyh^~^q, then by theorem [3] we have \y\ = 1 mod fc and Fp = 
{0, 1, . . . , fc — 1}, so here we have |a;| > fc + 1. More occurrences from {u, v} 
inside pxq will obviously give a longer factor x. 

The previous argument implies that any minimal uncompletable word has 
prefix pyh^^^q with p ^ q ~ u and \y\ = 1 mod fc. From the symmetry property 
observed in theorerr(2]we deduce that any minimal uncompletable word has sufhx 
q'a^~^yp' with p' = q' — v and \y\ = 1 mod fc. Clearly Fp' — {0}. To calculate 
Fqi note that, there is an occurrence r — u overlapping with q' and Fr ~ {!}. 
From theorem [3] we deduce that q' is an increasing occurrence and Fqi — {0, fc — 
1}. Let p be the next increasing occurrence. If \Fp\ = 3, then the same theorem 
implies that p — u, Fp — {0,fc — 2,fc — 1} and there are at least fc — 1 letters 
between p and q' . If \Fp\ < 2, then we would obviously require at least fc letters 
between q' and an increasing occurrence with three forbidden positions. 

Thus to increase the number of forbidden positions from 1 in the suffix v of 
w to 2 we need at least fc — 1 letters; from 2 to 3 - at least fc — 1; from ^ to ^ + 1 
for 3 < £ < fc — 2 we need at least Ak — A letters, and finally, from fc — 1 to fc 
we need fc — 1 letters. Besides we have at least fc — 1 increasing occurrences and 
the suffix w of w with only one forbidden position. Thus the length of a minimal 
uncompletable word is at least 3(fc — 1) + (4fc — 4)(fc — 4) + fc(fc — 1) + fc = 
5fc2 - 17fc + 13. 

5 Conclusion 

The series of sets Sk was found during exhaustive computational experiment. 
We searched for maximal with respect to inclusion non-complete sets among 
all the subsets of S-^; we were interested in such sets having longest possible 
minimal uncompletable word. We have found two extreme sets up to renaming 
letters and taking mirror image. Namely, S3 — [S^ \ {baa, &&a})u(Z'^ \ {aa, bb}) 
and (Z"^ \ {baa, bba}) U (Z'^ \ {ab, ba}) . Computation was based on representa- 
tion of a set S* as a flower automaton and on the fact that S is non-complete 
if and only if the corresponding non-deterministic automaton is synchronizing. 
Moreover, the set of uncompletable words coincides with the language of syn- 
chronizing words, see [T] and [3] for more details. The same task for fc = 4 
was unfeasible for a typical laptop, so the search was performed with restriction 
n S\ > 11. There is only one extreme non-complete set up to renaming let- 
ters and taking mirror image in this class. Namely, [S'^ \ {aabb, abaa, abbb}) U 
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(Z"^ \ {aba, bba, bbb}^ The length of a minimal uncompletable word for this ex- 
ample is 31 compared to 25 for the set Sk- So Sk is not optimal even for k — A. 
Thus, the lower bound 5fc^ — 17k + 13 is likely to be improved. Nevertheless, the 
most interesting question whether the tight bound is quadratic remains open. 
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